Search CORE

224 research outputs found

Adaptive Latency Insensitive Protocols andElastic Circuits with Early Evaluation: A Comparative Analysis

Author: Carloni
Casu
Luca Macchiarulo
Mario R. Casu
Murata
Publication venue: Elsevier
Publication date: 01/01/2009
Field of study

AbstractLatency Insensitive Protocols (LIP) and Elastic Circuits (EC) solve the same problem of rendering a design tolerant to additional latencies caused by wires or computational elements. They are performance-limited by a firing semantics that enforces coherency through a lazy evaluation rule: Computation is enabled if all inputs to a block are simultaneously available. Adaptive LIP's (ALIP) and EC with early evaluation (ECEE) increase the performance by relaxing the evaluation rule: Computation is enabled as soon as the subset of inputs needed at a given time is available. Their difference in terms of implementation and behavior in selected cases justifies the need for the comparative analysis reported in this paper. Results have been obtained through simple examples, a single representative case-study already used in the context of both LIP's and EC and through extensive simulations over a suite of benchmarks

Elsevier - Publisher Connector

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Recommended from our members

The Benefits of Using Clock Gating in the Design of Networks-on-Chip

Author: Carloni Luca
Petracca Michele
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2011
Field of study

Networks-on-chip (NoC) are critical to the design of complex multi-core system-on-chip (SoC) architectures. Since SoCs are characterized by a combination of high performance requirements and stringent energy constraints, NoCs must be realized with low-power design techniques. Since the use of semicustom design flow based on standard-cell technology libraries is essential to cope with the SoC design complexity challenges under tight time-to-market constraints, NoC must be implemented using logic synthesis. In this paper we analyze the major power reduction that clock gating can deliver when applied to the synthesis of a NoC in the context of a semi-custom automated design flow

Columbia University Academic Commons

Recommended from our members

Topology-Based Performance Analysis and Optimization of Latency-Insensitive Systems

Author: Carloni Luca
Collins Rebecca L.
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2008
Field of study

Latency-insensitive protocols allow system-on-chip engineers to decouple the design of the computing cores from the design of the inter-core communication channels while following the synchronous design paradigm. In a latency-insensitive system (LIS) each core is encapsulated within a shell, a synthesized interface module that dynamically controls its operation. At each clock period, if new data has not arrived on an input channel or a stalling request has arrived on an output channel, the shell stalls the core and buffers other incoming valid data for future processing. The combination of finite buffers and backpressure from stalling can cause throughput degradation. Previous works addressed this problem by increasing buffer space to reduce the backpressure requests or inserting extra buffering to balance the channel latency around a LIS. We explore the theoretical complexity of these approaches and propose a heuristic algorithm for efficient queue sizing. We also practically characterize several LIS topologies and how the topology of a LIS can impact not only how much throughput degradation will occur, but also the difficulty of finding optimal queue sizing solutions

Columbia University Academic Commons

Recommended from our members

Using Functional Independence Conditions to Optimize the Performance of Latency-Insensitive Systems

Author: Carloni Luca
Li Cheng-Hong
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2007
Field of study

In latency-insensitive design shell modules are used to encapsulate system components (pearls) in order to interface them with the given latency-insensitive protocol and dynamically control their operations. In particular, a shell stalls a pearl whenever new valid data are not available on its input channels. We study how functional independence conditions (FIC) can be applied to the performance optimization of a latency-insensitive system by avoiding unnecessary stalling of their pearls. We present a novel circuit design of a generic shell template that can exploit FICs. We also provide an automatic procedure for the logic synthesis of a shell instance that is only based on the particular local characteristics of its corresponding pearl and does not require any input from the designers. We conclude reporting on a set of experimental results that illustrate the beneits and overhead of the proposed technique

Columbia University Academic Commons

Recommended from our members

Leveraging Local Intra-Core Information to Increase Global Performance in Block-Based Design of Systems-on-Chip

Author: Carloni Luca
Li Cheng-Hong
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2008
Field of study

Latency-insensitive design is a methodology for system-on-chip (SoC) design that simplifies the reuse of intellectual property cores and the implementation of the communication among them. This simplification is based on a system-level protocol that decouples the intra-core logic design from the design of the inter-core communication channels. Each core is encapsulated within a shell, a synthesized logic block that dynamically controls its operation to interface it with the rest of the SoC and to absorb any latency variations on its I/O signals. In particular, a shell stalls a core whenever new valid data are not available on the input channels or a down-link core has requested a delay in the data production on the output channels. We study how knowledge about the internal logic structure of a core can be applied to the design of its shell to improve the overall system-level performance by avoiding unnecessary local stalling. We introduce the notion of functional independence conditions (FIC) and present a novel circuit design of a generic shell template that can leverage FIC. We propose a procedure for the logic synthesis of a FIC-shell instance that is only based on the analysis of the intra-core logic and does not require any input from the designers. Finally, we present a comprehensive experimental analysis that shows the performance benefits and limited design overhead of the proposed technique. This includes the semi-custom design of an SoC, an ultra-wideband baseband transmitter, using a 90nm industrial standard cell library

Columbia University Academic Commons

Flexible Filters: Load Balancing through Backpressure for Stream Programs

Author: Carloni Luca
Collins Rebecca L.
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2009
Field of study

Stream processing is a promising paradigm for programming multi-core systems for high-performance embedded applications. We propose flexible filters as a technique that combines static mapping of the stream program tasks with dynamic load balancing of their execution. The goal is to improve the system-level processing throughput of the program when it is executed on a distributed-memory multi-core system as well as the local (core-level) memory utilization. Our technique is distributed and scalable because it is based on point-to-point handshake signals exchanged between neighboring cores. Load balancing with flexible filters can be applied to stream applications that present large dynamic variations in the computational load of their tasks and the dimension of the stream data tokens. In order to demonstrate the practicality of our technique, we present the performance improvements for the case study of a JPEG encoder running on the IBM Cell multi-core processor

CiteSeerX

Columbia University Academic Commons

Recommended from our members

A Recursive Data-Driven Approach to Programming Multicore Systems

Author: Carloni Luca
Collins Rebecca L.
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2007
Field of study

In this paper, we propose a method to program divide-and-conquer problems on multicore systems that is based on a data-driven recursive programming model. Data intensive programs are difficult to program on multicore architectures because they require efficient utilization of inter-core communication. Models for programming multicore systems available today generally lack the ability to automatically extract concurrency from a sequential style program and map concurrent tasks to efficiently leverage data and temporal locality. For divide-and-conquer algorithms, a recursive programming model can address both of these problems. Furthermore, since a recursive function has the same behavior patterns at all granularities of a problem, the same recursive model can be used to implement a multicore program at all of its levels: 1. the operations of a single core, 2. how to distribute tasks among several cores, and 3. in what order to schedule tasks on a multicore system when it is not possible to schedule all of the tasks at the same time. We present a novel selective execution technique that can enable automatic parallelization and task mapping of a recursive program onto a multicore system. To verify the practicality of this approach, we perform a case-study of bitonic sort on the Cell BE processor

Columbia University Academic Commons

Torsion gravity with non-minimally coupled fermionic field: some cosmological models

Author: Carloni Sante
Fabbri Luca
Vignolo Stefano
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2015
Field of study

We investigate some cosmological models arising from a non-minimal coupling of a fermionic field to gravity in the geometrical setting of Einstein-Cartan-Sciama-Kibble gravity. The role played by the non-minimal coupling together with torsion in facing problems such as cosmological singularity, inflation and dark energy is discussed.Comment: 15 pages, no figures, updated and extended versio

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Genova

The dynamics of Bianchi I universes in $R^n$ cosmologies with torsion

Author: Carloni Sante
Fabbri Luca
Vignolo Stefano
Publication venue: 'IOP Publishing'
Publication date: 01/01/2013
Field of study

We analyze the phase space of Bianchi I cosmologies filled by a spin fluid in the framework of

f(R)

-gravity with torsion using a combination of the dynamical systems approach and the 1+3 covariant formalism. In the simple case of

f(R)=R^n

our results allow a quantification of the role of torsion and the spin of the cosmic fluid in the evolution of the cosmology. While torsion is able to modify the cosmological dynamics with respect to the purely metric case, the spin has little influence on the cosmology. We argue that this is due to the different symmetries of the tensor characterizing the anisotropies and the spin tensor. The cosmological model we analyzed presents isotropization for a wide set of initial conditions and values of the parameters and allows for two types of exotic bounce solutions.Comment: 22 page, 6 figures, submitted to Classical and Quantum Gravit

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Genova